Fix BigInteger char parsing with Ukrainian culture NBSP handling (bidirectional) by Copilot · Pull Request #123783 · dotnet/runtime

Copilot · 2026-01-30T01:56:02Z

Fix for BigInteger char parsing with Ukrainian culture NBSP handling (bidirectional)

Fixes #120283

Issue

BigInteger parsing fails with cultures that use NBSP (U+00A0) as NumberGroupSeparator (like Ukrainian uk-UA) when users type regular space (U+0020).

Solution

Modified the MatchChars function in Number.Parsing.Common.cs to handle bidirectional space-replacing character equivalence:

Added NormalizeSpaceReplacingChar helper that converts NBSP/narrow NBSP to regular space
Simplified comparison: NormalizeSpaceReplacingChar(cp) != NormalizeSpaceReplacingChar(val)
This handles both NBSP→space and space→NBSP equivalence for char (UTF-16) parsing
Much more readable than the previous complex boolean expression
Updated comments to reference Ukrainian culture alongside French and Kazakh

Testing

Added ParseWithNBSPAsGroupSeparator() test in existing parse.cs file
Test creates a custom culture (en-US base) with NBSP explicitly set as NumberGroupSeparator
Test validates that regular spaces ("1 234 567") are accepted when culture expects NBSP
Also tests trailing space handling
This approach is portable and doesn't depend on Ukrainian culture data which may vary across systems/ICU versions
Tests validate the bidirectional space equivalence fix for char (UTF-16) parsing ✓

Impact

This fix only impacts BigInteger parsing. Other numeric types (Complex, double, float, decimal) use different parsing infrastructure in System.Private.CoreLib that doesn't include the common parsing file.

Known Limitations

This fix addresses char (UTF-16) parsing only. UTF-8 parsing issues are tracked separately in #124016.

Progress

Implement bidirectional space equivalence for char (UTF-16)
Add portable tests using custom culture in existing parse.cs file
Simplify design based on code review feedback
Optimize condition ordering for performance
Refactor to helper function for better readability
Make tests portable across different ICU versions
Fix test to actually include spaces to test space equivalence

✨ Let Copilot coding agent set things up for you — coding agent works faster and does higher quality work when set up for your repo.

Co-authored-by: artl93 <9855795+artl93@users.noreply.github.com>

artl93 · 2026-01-31T00:58:48Z

@copilot

Consider your design. Why is it correct and the simplest form of the solution? Are there other possibilities? I believe you consisdered the following. Why did you pick something different?

        private static unsafe TChar* MatchChars<TChar>(TChar* p, TChar* pEnd, ReadOnlySpan<TChar> value)
            where TChar : unmanaged, IUtfChar<TChar>
        {
            Debug.Assert((p != null) && (pEnd != null) && (p <= pEnd));

            fixed (TChar* stringPointer = &MemoryMarshal.GetReference(value))
            {
                TChar* str = stringPointer;

                if (TChar.CastToUInt32(*str) != '\0')
                {
                    // We only hurt the failure case
                    // This fix is for French, Kazakh, and Ukrainian cultures. Since a user cannot type 0xA0 or 0x202F as a
                    // space character we use 0x20 space character instead to mean the same.
                    // We also need to handle the reverse case where the input has 0xA0 or 0x202F and the format string has 0x20.
                    while (true)
                    {
                        uint cp = (p < pEnd) ? TChar.CastToUInt32(*p) : '\0';
                        uint val = TChar.CastToUInt32(*str);

                        if ((cp != val) && !((IsSpaceReplacingChar(val) && (cp == '\u0020')) || (IsSpaceReplacingChar(cp) && (val == '\u0020'))))
                        {
                            break;
                        }

                        p++;
                        str++;

                        if (TChar.CastToUInt32(*str) == '\0')
                        {
                            return p;
                        }
                    }
                }
            }

            return null;
        }
    }
    ```

Copilot · 2026-01-31T01:24:29Z

@copilot

Consider your design. Why is it correct and the simplest form of the solution? Are there other possibilities? I believe you consisdered the following. Why did you pick something different...

Thank you for the question! I tested your simpler solution (commit 9f5c3ae5) and you're right that it's more elegant. However, both approaches have the same limitation:

The core issue: IsSpaceReplacingChar checks Unicode codepoints (0xA0, 0x202F), but UTF-8 encodes NBSP as TWO bytes (0xC2 0xA0). When TChar is byte:

Pattern contains [0xC2, 0xA0] (UTF-8 NBSP)
Input contains [0x20] (space)
First byte comparison: IsSpaceReplacingChar(0xC2) returns false (0xC2 ≠ 0xA0)
Match fails

My complex UTF-8 handling attempted to fix this but introduced bugs with pointer advancement. Your simpler bidirectional check works perfectly for char (UTF-16) but not for UTF-8, which is why the original outer loop test still fails with both approaches.

The simpler solution is correct and I've adopted it (removes ~70 lines of buggy code). The remaining UTF-8 issue needs a different approach - likely at the point where NumberGroupSeparatorTChar<TChar>() returns the separator, it should handle the space equivalence transformation rather than in MatchChars.

…-8 handling) Co-authored-by: artl93 <9855795+artl93@users.noreply.github.com>

artl93

@copilot - a simple test should be made to the parse test file - not its own file. Please use existing tests instead of adding them in their own.

Co-authored-by: artl93 <9855795+artl93@users.noreply.github.com>

Copilot · 2026-01-31T01:39:24Z

@copilot - a simple test should be made to the parse test file - not its own file. Please use existing tests instead of adding them in their own.

Done! I've moved the two Ukrainian culture tests into the existing parse.cs file (as [Fact] tests at the end of the parseTest class) and removed the separate parse.ukUA.cs file. Commit ee8019f.

…llowTrailingWhite

artl93 · 2026-02-10T02:10:00Z

/azp run runtime-libraries-coreclr outerloop

azure-pipelines · 2026-02-10T02:10:16Z

Azure Pipelines successfully started running 1 pipeline(s).

tarekgh · 2026-02-10T18:33:53Z

@copilot is this related failure to the change here?

{ "HelixJobId": "bd8a8479-7771-47b4-bfb6-9b29c8662936", "HelixWorkItemName": "System.Runtime.Numerics.Tests" }

Error message
System.FormatException : The value could not be parsed.

Stack trace
   at System.Number.ThrowOverflowOrFormatException(ParsingStatus status) in /_/src/libraries/System.Runtime.Numerics/src/System/Number.BigInteger.cs:line 29
   at System.Number.ParseBigInteger[TChar](ReadOnlySpan`1 value, NumberStyles style, NumberFormatInfo info) in /_/src/libraries/System.Runtime.Numerics/src/System/Number.BigInteger.cs:line 136
   at System.Numerics.BigInteger.Parse(ReadOnlySpan`1 utf8Text, NumberStyles style, IFormatProvider provider) in /_/src/libraries/System.Runtime.Numerics/src/System/Numerics/BigInteger.cs:line 707
   at System.Numerics.Tests.parseTest.VerifyParseSpanToString(String num1, NumberStyles ns, Boolean failureNotExpected, String expected) in /_/src/libraries/System.Runtime.Numerics/tests/BigInteger/parse.cs:line 828
   at System.Numerics.Tests.parseTest.VerifyParseToString(String num1, NumberStyles ns, Boolean failureNotExpected, String expected) in /_/src/libraries/System.Runtime.Numerics/tests/BigInteger/parse.cs:line 884
   at System.Numerics.Tests.parseTest.VerifyParseToString(String num1, NumberStyles ns, Boolean failureNotExpected) in /_/src/libraries/System.Runtime.Numerics/tests/BigInteger/parse.cs:line 818
   at System.Numerics.Tests.parseTest.VerifyNumberStyles(NumberStyles ns, Random random) in /_/src/libraries/System.Runtime.Numerics/tests/BigInteger/parse.cs:line 689
   at System.Numerics.Tests.parseTest.RunParseToStringTests(CultureInfo culture) in /_/src/libraries/System.Runtime.Numerics/tests/BigInteger/parse.cs:line 52
   at InvokeStub_parseTest.RunParseToStringTests(Object, Span`1)
   at System.Reflection.MethodBaseInvoker.InvokeWithOneArg(Object obj, BindingFlags invokeAttr, Binder binder, Object[] parameters, CultureInfo culture) in /_/src/libraries/System.Private.CoreLib/src/System/Reflection/MethodBaseInvoker.cs:line 95

Copilot · 2026-02-10T18:37:47Z

@copilot is this related failure to the change here?

...

This is the original issue that the PR partially addresses. The failure is in UTF-8 parsing (line 828: BigInteger.Parse(utf8Num1, ns)), which is the known limitation documented in the PR description. My fix only addresses char (UTF-16) parsing - the bidirectional equivalence check works for char but not for UTF-8 because IsSpaceReplacingChar operates on Unicode codepoints (0xA0) while UTF-8 encodes NBSP as two bytes (0xC2 0xA0). UTF-8 parsing is tracked in #124016 and requires architectural changes beyond this PR's scope.

artl93 · 2026-02-10T18:57:30Z

Forgot to check results - yes. It's related.

…

________________________________ From: Tarek Mahmoud Sayed ***@***.***> Sent: Tuesday, February 10, 2026 10:44 AM To: dotnet/runtime ***@***.***> Cc: Art Leonard ***@***.***>; Assign ***@***.***> Subject: Re: [dotnet/runtime] Fix BigInteger char parsing with Ukrainian culture NBSP handling (bidirectional) (PR #123783) [https://avatars.githubusercontent.com/u/10833894?s=20&v=4]tarekgh left a comment (dotnet/runtime#123783)<#123783 (comment)> @copilot<https://gh.io/copilot-coding-agent-docs> is this related failure to the change here? { "HelixJobId": "bd8a8479-7771-47b4-bfb6-9b29c8662936", "HelixWorkItemName": "System.Runtime.Numerics.Tests" } Error message System.FormatException : The value could not be parsed. Stack trace at System.Number.ThrowOverflowOrFormatException(ParsingStatus status) in /_/src/libraries/System.Runtime.Numerics/src/System/Number.BigInteger.cs:line 29 at System.Number.ParseBigInteger[TChar](ReadOnlySpan`1 value, NumberStyles style, NumberFormatInfo info) in /_/src/libraries/System.Runtime.Numerics/src/System/Number.BigInteger.cs:line 136 at System.Numerics.BigInteger.Parse(ReadOnlySpan`1 utf8Text, NumberStyles style, IFormatProvider provider) in /_/src/libraries/System.Runtime.Numerics/src/System/Numerics/BigInteger.cs:line 707 at System.Numerics.Tests.parseTest.VerifyParseSpanToString(String num1, NumberStyles ns, Boolean failureNotExpected, String expected) in /_/src/libraries/System.Runtime.Numerics/tests/BigInteger/parse.cs:line 828 at System.Numerics.Tests.parseTest.VerifyParseToString(String num1, NumberStyles ns, Boolean failureNotExpected, String expected) in /_/src/libraries/System.Runtime.Numerics/tests/BigInteger/parse.cs:line 884 at System.Numerics.Tests.parseTest.VerifyParseToString(String num1, NumberStyles ns, Boolean failureNotExpected) in /_/src/libraries/System.Runtime.Numerics/tests/BigInteger/parse.cs:line 818 at System.Numerics.Tests.parseTest.VerifyNumberStyles(NumberStyles ns, Random random) in /_/src/libraries/System.Runtime.Numerics/tests/BigInteger/parse.cs:line 689 at System.Numerics.Tests.parseTest.RunParseToStringTests(CultureInfo culture) in /_/src/libraries/System.Runtime.Numerics/tests/BigInteger/parse.cs:line 52 at InvokeStub_parseTest.RunParseToStringTests(Object, Span`1) at System.Reflection.MethodBaseInvoker.InvokeWithOneArg(Object obj, BindingFlags invokeAttr, Binder binder, Object[] parameters, CultureInfo culture) in /_/src/libraries/System.Private.CoreLib/src/System/Reflection/MethodBaseInvoker.cs:line 95 — Reply to this email directly, view it on GitHub<#123783 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/ACLGGM2BA66I6ITKWB2ISFD4LIRH7AVCNFSM6AAAAACTMH5XXSVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZTQNZZHE4TCMBRGQ>. You are receiving this because you were assigned.Message ID: ***@***.***>

src/libraries/System.Runtime.Numerics/tests/BigInteger/parse.cs

src/libraries/Common/src/System/Number.Parsing.Common.cs

Co-authored-by: stephentoub <2642209+stephentoub@users.noreply.github.com>

stephentoub · 2026-02-11T19:35:25Z

🤖 Copilot Code Review — PR #123783

Holistic Assessment

Motivation: The PR addresses a real test failure with Ukrainian culture (uk-UA) parsing in BigInteger where the test generates numbers with regular spaces but the culture's NumberGroupSeparator is NBSP (U+00A0). The linked issue #120283 confirms this is a genuine problem blocking CI.

Approach: The fix normalizes both input characters and format characters to a common space (U+0020) before comparison, making the equivalence bidirectional. This is a reasonable approach that aligns with how DateTimeParse.cs handles the same problem.

Summary: ⚠️ Needs Human Review. The fix is functionally correct and addresses the reported issue, but there is a performance concern with the implementation that warrants maintainer attention before merging.

Detailed Findings

⚠️ Performance Regression in Hot Path — Should Consider Restoring Fast Path

The change removes the implicit fast-path for identical characters that existed in the original code.

Original code:

if ((cp != val) && !(IsSpaceReplacingChar(val) && (cp == '\u0020')))

When cp == val (the common case for most cultures), this short-circuits after 1 comparison.

New code:

if (NormalizeSpaceReplacingChar(cp) != NormalizeSpaceReplacingChar(val))

This unconditionally calls NormalizeSpaceReplacingChar twice, which each call IsSpaceReplacingChar (2 comparisons). So the common success path now costs ~5 operations instead of 1.

The existing comment "We only hurt the failure case" was intentional—the original design preserved fast-path performance. Consider restoring it:

if (cp != val && NormalizeSpaceReplacingChar(cp) != NormalizeSpaceReplacingChar(val))
{
    break;
}

This preserves the exact-match fast path while still handling the bidirectional NBSP/space equivalence correctly.

Flagged by: Gemini, primary reviewer

✅ Correctness — Bidirectional Normalization is Correct

The bidirectional normalization logic is sound:

NBSP (U+00A0) in input matches Space (U+0020) in format ✓
Space (U+0020) in input matches NBSP (U+00A0) in format ✓
Narrow NBSP (U+202F) handled symmetrically ✓

This correctly fixes the reported uk-UA test failures.

✅ Test Coverage — Adequate

The new ParseWithNBSPAsGroupSeparator test covers the key permutations:

Culture with NBSP separator, input with regular spaces
Culture with regular space separator, input with NBSP
Culture with regular space separator, input with narrow NBSP (U+202F)

Using a custom culture based on en-US with explicit NumberGroupSeparator makes the test portable across different ICU versions.

💡 Comment Accuracy — Minor Suggestion

The existing comment "We only hurt the failure case" should be updated if the fast-path optimization is not restored, since it would no longer be accurate. If the fast-path is restored per the suggestion above, the comment remains correct.

Cross-Cutting Analysis

Scope verification: The PR correctly identifies that this fix only affects System.Runtime.Numerics (BigInteger, Complex). The shared file Number.Parsing.Common.cs is not used by System.Private.CoreLib's parsing code—I verified CoreLib has its own separate Number.Parsing.cs that doesn't use MatchChars.
DateTimeParse comparison: The approach aligns with how DateTimeParse.cs handles the same NBSP issue (line 5722-5723), though that implementation is slightly different (one-directional: only input NBSP matches format space).
UTF-8 parsing: The PR correctly notes this is for char (UTF-16) parsing only. If UTF-8 has the same issue, that's tracked separately.

Models contributing to this review: Claude Sonnet 4, Google Gemini 3 Pro

src/libraries/Common/src/System/Number.Parsing.Common.cs

artl93 · 2026-02-11T22:51:47Z

/azp run runtime-libraries-coreclr outerloop

azure-pipelines · 2026-02-11T22:52:03Z

Azure Pipelines successfully started running 1 pipeline(s).

Initial plan

d8192f4

Copilot AI assigned Copilot and artl93 Jan 30, 2026

Copilot started work on behalf of artl93 January 30, 2026 01:56 View session

Copilot AI and others added 4 commits January 30, 2026 02:00

Fix BigInteger UTF-8 parsing with Ukrainian culture and add tests

cef5ce0

Co-authored-by: artl93 <9855795+artl93@users.noreply.github.com>

Improve UTF-8 handling for space-replacing characters in MatchChars

31cf78e

Co-authored-by: artl93 <9855795+artl93@users.noreply.github.com>

Simplify Ukrainian culture tests - basic functionality working

eb6161f

Co-authored-by: artl93 <9855795+artl93@users.noreply.github.com>

Address code review feedback - update comments and tests

1066da7

Co-authored-by: artl93 <9855795+artl93@users.noreply.github.com>

Copilot AI changed the title ~~[WIP] Fix System.Numerics.Tests parseTest failures for culture uk-UA~~ Fix BigInteger UTF-8 parsing with Ukrainian culture NBSP handling Jan 30, 2026

Copilot AI requested a review from artl93 January 30, 2026 02:43

Copilot finished work on behalf of artl93 January 30, 2026 02:43

Copilot started work on behalf of artl93 January 31, 2026 00:59 View session

Simplify to bidirectional space equivalence check (remove complex UTF…

71f03ee

…-8 handling) Co-authored-by: artl93 <9855795+artl93@users.noreply.github.com>

Copilot AI changed the title ~~Fix BigInteger UTF-8 parsing with Ukrainian culture NBSP handling~~ Fix BigInteger char parsing with Ukrainian culture NBSP handling (bidirectional) Jan 31, 2026

Copilot finished work on behalf of artl93 January 31, 2026 01:25

artl93 requested changes Jan 31, 2026

View reviewed changes

Copilot started work on behalf of artl93 January 31, 2026 01:32 View session

Move Ukrainian culture tests into existing parse.cs file

ee8019f

Co-authored-by: artl93 <9855795+artl93@users.noreply.github.com>

Copilot AI requested a review from artl93 January 31, 2026 01:40

Copilot finished work on behalf of artl93 January 31, 2026 01:40

This was referenced Jan 31, 2026

[Wasm] The request was canceled due to the configured HttpClient.Timeout of 100 seconds elapsing #123572

Open

XHarness package install failure on iOS due to devicectl NSPOSIXErrorDomain error 49 #123796

Open

Copilot finished work on behalf of tarekgh February 5, 2026 17:19

Test all three MatchChars equivalence branches instead of unrelated A…

c8c519e

…llowTrailingWhite

build-analysis bot mentioned this pull request Feb 10, 2026

Unable to pull image from mcr.microsoft.com #117164

Open

artl93 approved these changes Feb 10, 2026

View reviewed changes

Copilot started work on behalf of tarekgh February 10, 2026 18:34 View session

Copilot finished work on behalf of tarekgh February 10, 2026 18:39

Copilot started work on behalf of artl93 February 10, 2026 18:58 View session

Copilot AI requested a review from artl93 February 10, 2026 19:03

Copilot finished work on behalf of artl93 February 10, 2026 19:04

stephentoub reviewed Feb 10, 2026

View reviewed changes

src/libraries/System.Runtime.Numerics/tests/BigInteger/parse.cs Show resolved Hide resolved

src/libraries/Common/src/System/Number.Parsing.Common.cs Outdated Show resolved Hide resolved

stephentoub approved these changes Feb 10, 2026

View reviewed changes

Copilot started work on behalf of stephentoub February 10, 2026 20:45 View session

Refactor to NormalizeSpaceReplacingChar helper for better readability

fb34cb1

Co-authored-by: stephentoub <2642209+stephentoub@users.noreply.github.com>

Copilot AI requested a review from stephentoub February 10, 2026 21:17

Copilot finished work on behalf of stephentoub February 10, 2026 21:17

stephentoub reviewed Feb 11, 2026

View reviewed changes

src/libraries/Common/src/System/Number.Parsing.Common.cs Outdated Show resolved Hide resolved

Apply suggestion from @stephentoub

bfc2c9b

stephentoub approved these changes Feb 11, 2026

View reviewed changes

build-analysis bot mentioned this pull request Feb 12, 2026

System.Security.Cryptography.CryptographicException : m_safeCertContext is an invalid handle. #124279

Closed

artl93 merged commit c667ecc into main Feb 12, 2026
83 of 98 checks passed

artl93 deleted the copilot/fix-parse-test-uk-ua branch February 12, 2026 06:10

dotnet-maestro bot mentioned this pull request Feb 13, 2026

[main] Source code updates from dotnet/runtime dotnet/dotnet#4839

Merged

Conversation

Copilot AI commented Jan 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Fix for BigInteger char parsing with Ukrainian culture NBSP handling (bidirectional)

Issue

Solution

Testing

Impact

Known Limitations

Progress

Uh oh!

artl93 commented Jan 31, 2026

Uh oh!

Copilot AI commented Jan 31, 2026

Uh oh!

artl93 left a comment

Choose a reason for hiding this comment

Uh oh!

Copilot AI commented Jan 31, 2026

Uh oh!

artl93 commented Feb 10, 2026

Uh oh!

azure-pipelines bot commented Feb 10, 2026

Uh oh!

tarekgh commented Feb 10, 2026

Uh oh!

Copilot AI commented Feb 10, 2026

Uh oh!

artl93 commented Feb 10, 2026 via email

Uh oh!

Uh oh!

Uh oh!

stephentoub commented Feb 11, 2026

🤖 Copilot Code Review — PR #123783

Holistic Assessment

Detailed Findings

⚠️ Performance Regression in Hot Path — Should Consider Restoring Fast Path

✅ Correctness — Bidirectional Normalization is Correct

✅ Test Coverage — Adequate

💡 Comment Accuracy — Minor Suggestion

Cross-Cutting Analysis

Uh oh!

Uh oh!

artl93 commented Feb 11, 2026

Uh oh!

azure-pipelines bot commented Feb 11, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Copilot AI commented Jan 30, 2026 •

edited

Loading